Sample Pages to Be Followed Exactly in Preparing Scripts Generalization of Reinforcement Learning with Cmac

نویسندگان

  • Sunggyu Kwon
  • Kwang Y. Lee
چکیده

To implement a generalization of value functions in Adaptive Search Element (ASE)-reinforcement learning, CMAC is integrated into ASE controller. ASEreinforcement learning scheme is briefly studied to discuss how CMAC is integrated into ASE controller. Neighbourhood Sequential Training concept is utilized to establish the look-up table of CMAC and to produce discrete control outputs. In computer simulation, an ASE controller and a couple of ASE-CMAC neural network are trained to balance the inverted pendulum on a cart. The number of trials until the controllers are established and the learning performance of the controllers are evaluated to find that generalization ability of the CMAC improves the speed of the ASE-reinforcement learning enough to realize the cartpole control system. Copyright© 2005 IFAC

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Two Novel Learning Algorithms for CMAC Neural Network Based on Changeable Learning Rate

Cerebellar Model Articulation Controller Neural Network is a computational model of cerebellum which acts as a lookup table. The advantages of CMAC are fast learning convergence, and capability of mapping nonlinear functions due to its local generalization of weight updating, single structure and easy processing. In the training phase, the disadvantage of some CMAC models is unstable phenomenon...

متن کامل

Sample Pages to Be Followed Exactly in Preparing Scripts Adaptive Control of a Coupled Drives Apparatus Using Dual Youla-kucera Parametrization

An adaptive algorithm based on the dual Youla-Kucera parametrization is introduced enabling simple closed-loop identification and adaptation of a class of symmetric MIMO systems. The methodology exploits the algebraic approach to control system design. Necessary conditions for usage of the developed method are discussed and results are presented for the case of coupled drives control. Copyright...

متن کامل

A Q-learning with Selective Generalization Capability and its Application to Layout Planning of Chemical Plants

Under environments that the criteria to achieve a certain objective is unknown, the reinforcement learning is known to be effective to collect, store and utilize information returned from the environments. Without a supervisor, the method can construct criteria for evaluation of actions to achieve the objective. However, since the information received by a learning agent is obtained through an ...

متن کامل

Web pages ranking algorithm based on reinforcement learning and user feedback

The main challenge of a search engine is ranking web documents to provide the best response to a user`s query. Despite the huge number of the extracted results for user`s query, only a small number of the first results are examined by users; therefore, the insertion of the related results in the first ranks is of great importance. In this paper, a ranking algorithm based on the reinforcement le...

متن کامل

Sample Pages to Be Followed Exactly in Preparing Scripts Persistent Motion and Chaos in Attitude Control with Switching Actuators

In systems with switching actuators persistent motions of different nature may occur, such as limit cycles, quasi-periodic and chaotic motions. In this contribution the nature of persistent motions in an attitude control system with switching actuators subject to switching restrictions are examined as a function of controller parameters. Bifurcation diagrams are used to describe observations. I...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2005